Reinforced walks in two and three dimensions 
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In probability theory, reinforced walks are random walks on a lattice (or more generally a graph) 
that preferentially revisit neighboring 'locations' (sites or bonds) that have been visited before. In 
this paper, we consider walks with one-step reinforcement, where one preferentially revisits locations 
irrespective of the number of visits. Previous numerical simulations [A. Ordemann et al, Phys. Rev. 
E 64, 046117 (2001)] suggested that the site model on the lattice shows a phase transition at finite 
reinforcement between a random-walk like and a collapsed phase, in both 2 and 3 dimensions. 
The very different mathematical structure of bond and site models might also suggest different 
phenomenology (critical properties, etc.). We use high statistics simulations and heuristic arguments 
to suggest that site and bond reinforcement are in the same universality class, and that the purported 
phase transition in 2 dimensions actually occurs at zero coupling constant. We also show that a 
quasi-static approximation predicts the large time scaling of the end-to-end distance in the collapsed 
phase of both site and bond reinforcement models, in excellent agreement with simulation results. 

PACS numbers: 



I. INTRODUCTION 



A. Background: Physics 



Random walks with memory have a large number of 
applications in physics and other sciences. Many variants 
have thus been studied in different contexts. The best 
known example is presumably the self-avoiding walk [l|, 
which models the large scale behavior of flexible chain 
polymers in good solvents. As pointed out by Amit 
et al. the name 'self-avoiding walk' is something 
of a misnomer, since this model describes either static 
self-avoiding chains or self-killing walks. When the self- 
avoiding walker tries to revisit a site it has visited before, 
it is not gently turned towards another neighboring site; 
it is killed. In a more general version of this model, walk- 
ers carry an initial weight of unity, which decreases by a 
fixed factor whenever a site is revisited (the Domb- Joyce 
model [1]). If a site i had been visited n times before, 
the weight is diminished at the n + 1 visit by e"" with 
M < 0. 

When the sign of the interaction is changed to u > 0, 
so that the weight is multiplied by a factor e"" > 1 at 
each revisit, the resulting self- attracting walk degenerates 
in any finite dimension; for large times, the walker just 
oscillates between two sites. This extreme behavior is 
avoided if the weight change is independent of n and one 
distinguishes only between sites which have and have not 
been visited before. This model is related to the Donsker- 
Varadhan [1] "Wiener sausage" problem Q and leads 
to a power law scaling Rt ~ for the end-to-end 

distance after t time steps in d dimensions of space. 

In contrast to these "static" models, where instances 
are weighted and the weights are modified by interac- 
tions, one can define "dynamic" models where the walks 
are biased by the interaction. The oldest such model is 
the true self-avoiding walk (TSAW) of Amit et al. Q. 



Assume that at time t the walker is at site i, and that 
the number of previous visits to any of the Af neighbors 
is Uj, J ~ 1, A/". Then the probability to step to neigh- 
bor j at the next time is 



Pj 



u < . 



(1) 



This is a much milder modification than the original 
self-avoiding walk. Accordingly, the r.m.s. end-to-end 
distance scales as ^ t for d > 2, while there are 
logarithmic corrections at the 'upper critical dimension' 
d ^ dc ^ 2. In contrast, the upper critical dimension for 



the self-avoiding walk is dc 
with < 1/2 



4, and ^ t^'^d foj. d<dc 



B. True self-attracting walks 

When the sign of u is switched to positive, the resulting 
"true self-attracting walks" (TSATWs) are also closer to 
random walks than the ordinary self-attracting walks. It 
seems that the behavior of the TSATW with pj given by 
Eq.([T]) but with u > is unknown. On the other hand, 
there are several numerical studies of the TSATW with 
one-step reinforcement [1, 0, H, S [13, [HI 



Pi 



M > . 



(2) 



where Kj = if the site j has never been visited be- 
fore, and Kj = 1 otherwise. By far the most extensive 
studies were those of [13, [HI, which claimed that one- 
step reinforcement TSATWs on the lattice showed a non- 
trivial phase transition in both d = 2 and d = 3, with 
Uc{d = 2)= 0.88 ± 0.05 and Uc{d = 3) = 1.92 ± 0.03. In 
both cases, the behavior of R^ is supposed to change at 
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Uc from t at u < Uc to 

32 



(3) 



at M > Uc- Hence the phase transition is between a 
random-walk-Hke and a collapsed phase. At the criti- 
cal point, Rt scales with a new exponent Vr. which is 
0.40±0.01 in d = 2 and 0.303±0.005 md = 3^. These 
phase transitions are also seen in the average number (St) 
of sites visited up to time t. This scales as (St) ^ t for 
u < Uc (with a logarithmic correction for d ~ 2), but as 
t'^ with 



k = d/{d+l) 



(4) 



for u > Uc- The latter was derived from a quasi-static 
approximation in Ref. Q. The quasi-static approx- 
imation seems to be satisfied to high precision (see be- 
low). Aiu~ Uc, Ordemann et al. found = 0.80±0.01 
(d = 2) resp. 0.91 ± 0.01 {d = 3) 



C. Background: Mathematics 

In a parallel and largely independent development, 
these and similar random walks with memory have been 
extensively studied in the probability theory literature. 
For a recent survey, see [l3|. The rigorous mathemati- 
cal study of reinforced walks displays much more breadth 
than the rather limited study of one-step site reinforce- 
ment in the statistical physics literature. In contrast to 
the physics literature, which focuses on the site model, 
bond or "edge" reinforced random walks (ERRW) have 
been studied in great detail and with multiple reinforce- 
ment as in Eq.([T]) with positive u. Such walks (most 
clearly on trees) are closely related to Polya urn pro- 
cesses and similar problems with reinforcement that can 
be solved exactly (note that walks with bond reinforce- 
ment are often called 'trails' in the physics literature 
[iBl)- For the models with multiple site reinforcement 
discussed above (called vcrtcx-rcinforccd random walks 
or VRRW), the related urn process is Friedman-like and 
less tractable [H, [13] , see endnote [ll] . 

The mathematicians have discovered profound differ- 
ences between these two models. For example, ERRW 
is recurrent on finite graphs [H, [13], meaning that ev- 
ery edge is traversed infinitely often, while VRRW is 
not, becoming trapped e.g. on a line of five vertices 



13 



14, 16 I or more generally on "trapping subgraphs" 
1^ . [m [20| . Many properties of the ERRW remain 
unknown; for example, the recurrence of ERRW on the 
infinite 2-d lattice is an open question. Even the one-step 
ERRW model (called once-reinforced in the mathemat- 
ics literature) has only been successfully studied on a few 
special graphs, e.g. the infinite regular or Galton- Watson 
tree (where it is transient fl3 . [2l] l22j |l or the infinite lad- 



der (where it is recurrent 13l . 23| |) . The recurrence of one- 
step ERRW on the infinite 2-d lattice remains essentially 
open, although Scllkc showed the separate recurrence of 



each coordinate [H, Pcmantle [H, [T3| provides a 

more complete description of these and other results. 

The difference in mathematical tractability and under- 
lying structure might suggest that models with bond and 
site reinforcement show different phenomenology. But 
this result would be unexpected from considerations of 
universality in statistical physics. 



D. Overview of Results 

In the present paper, we clarify some of these issues by 
means of high precision simulations. Our main results 
are: 

• Bond and site reinforced TSATWs with one-step 
reinforcement show the same critical behavior and 
are likely in the same universality class; 

• There is no finite u phase transition in the 2- 
dimcnsional TSATW model with one-step rein- 
forcement. Walks are in the collapsed phase for all 
u > and the phase transition happens at Uc ~ 0', 

• The critical point and the critical exponents for 
TSATWs with one-step reinforcement in d = 3 
are markedly different from the values obtained in 
0,[ni; and 

• The quasistatic approximation for the end-to-end 
distance seems to become exact as <: — > oo for the 
collapsed phase. 



II. NUMERICAL METHODS AND RESULTS 
A. Methods 

Simulations of TSATWs with one-step reinforcement 
arc straightforward. To keep track of previous visits, one 
has to store a one-bit "spin" variable for each site 
(bond) i, and clear all spins after each walk. For conve- 
nience we sometimes used one byte per spin, which has 
the added advantage that clearing is needed only after 
every 255th walk. This requires resp. L"^ bytes of 

memory for site TSATWs and dL'^/8 resp dL'^ bytes for 
bond TSATWs. Memory limitations were more severe 
than CPU time so in the following we show more detailed 
results for site TSATWs than for bond TSATWs. 

The most serious potential source of systematic errors 
arises from lattices that arc too small. If open boundary 
conditions (b.c.) are used, the walk cannot go beyond 
the boundary, and both Rt and (St) are underestimated. 
If periodic b.c. are used and the walk wraps around the 
lattice, it finds visited terrain in front of it and Rt is over- 
estimated, while (St) is still underestimated. We used 
lattices with helical boundary conditions and with up to 
N = 2^2 sites (d = 2) resp. 2^4 sites (d = 3), see end- 
note [29| . For each walk, the spans a;niax — a^min in all 
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FIG. 1: (color online) Statistical errors (1 st.d.) of the three 
estimators for the average squared end-to-end distance di- 
vided by t (direct, R^/t, and [Rt]opt/t from top to bottom) 
for 2-d site TSATWs with u = 0.34. The number of walks 
in this sample was 10*, which is only a small fraction of our 
total sample. 



FIG. 2: (color online) Average squared end-to-end distances 
for site TSATWs in 3 dimensions, divided by the number t 
of steps. Each curve corresponds to a fixed value of u, with 
u increasing from top to bottom. Here and in all subsequent 
figures, the curves are labelled hy w — exp(u). The critical 
value u = Uc corresponds to a straight curve in the limit 
i — > 00 whose slope is 2vc—^- Statistical errors are comparable 
to the thickness of the curves. 



d directions were measured, and it was checked that the 
fraction of walks where any span was > L did not exceed 
10"''. This restricted the number of steps per walk to 
imax < 10^ for d = 2, and to t^ax « 4 x lO"^ for d = 3. 
The total number of walks for each parameter setting was 
typically « 2 x lO'' to « 2 x 10^ 

B. Variance reduction 



with at fixed for each t such that the variance of [i?f]opt 
is minimal. Differentiating the variance of [i?j]opt with 
respect to at and minimizing this variance requires the 

estimation of the variances of i?^ and as well as their 
covariance. In Fig. [T] we show the errors (single standard 
deviations, divided by t) of the three estimators for 2-d 
site TSATWs with u = 0.34. 



For small u, where walk-to- walk variation is significant, 
substantially increased accuracy is obtained by the fol- 
lowing variance reduction procedure. Assume that the 
walker has already made t steps and is presently at a site 
with cartesian coordinates x^. Given Xj and the states of 
the neighboring sites (i.e., visited or unvisited), one can 
calculate the expected increment Axt^^i for the next step, 
since one knows the probability for the walker to step in 
each direction. From this, one obtains an estimate for 
the increment of 

aS^i = [x2:~x2] = 2xt • Axt+1 + 1 . (5) 

where we have used the fact that Axt+i • Axt+i = 1. 
The improved estimate is obtained by summing these 
increments, 

t 

Rl = Y.^t'- (6) 

t'=i 

Further improvement is obtained by taking the optimal 
linear combination of the direct sample average and this 
estimator, 



C. Site TSATWs in d = 3 

The average r.m.s end-to-end distance divided by the 
number of steps, t~^R^, is shown in Fig. [2] for site 
TSATWs in d = 3. In this and in all subsequent figures, 
curves are not labelled by u but by w = exp(M). We see 
clearly that there are significant corrections to scaling (all 
curves bend upward at small t), but they are no worse 
than in other nonequilibrium critical phenomena. A more 
careful analysis, taking these corrections into account, 
gives Uc = 1.831 ± 0.002 {wc = exp(Mc) = 6.24 ± 0.01) 
and Vc = 0.378 ± 0.004. In particular, we can rule out 
the possibility that Uc > 1.85 from the simple fact that 
all curves for u > 1.85 (i.e. for w > 6.35) are clearly 
S-shaped and curve down at large t. These estimates 
are incompatible with those of [11|, Uc ~ 1.92 ± 0.03 and 
i/c = 0.303±0.005. Possible explanations for these earlier 
results are that corrections to scaling were neglected in 
or that the lattices used were too small. 

The cross-over behavior near u k can be fitted to 
the usual ansatz 



[RX^t = atRi + (1 - at)R'i , 



(7) 



i?2 ^ f^-FKu - Uc)t'^] + corrections 



(8) 
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FIG. 3: (color online) The same data as in Fig. (2] plotted 
now as R^/t^'^" versus {u — Uc)t'^ , with Vc ~ 0.378, Uc = 1.831 
(wc = 6.24), and (j) = 0.185. If there were perfect scaling, all 
data would fall on a single curve. 
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FIG. 4: (color online) Average number of visited sites for 
site TSATWs in 3 dimensions, divided by the number t of 
steps. Curves correspond to the same values of u as in Fig. [2] 
Statistical errors are much smaller than the thickness of the 



with (j) ~ 0.185 ± 0.020, as seen from the data collapse 
shown in Fig. [3] The deviations from a perfect collapse 
seen in this figure are due to the corrections to scaling 
at small t seen in Fig. ^ which are not included in the 
scaling ansatz Eq. ^ . The apparent collapse could have 
improved by the widespread practice of plotting the sub- 
and supercritical branches separately, without demand- 
ing that they join smoothly (the function F{z) must be 
analytic at z = 0). But the results obtained in this way 
would be spurious. 

Results for the average number of visited sites, {St), 
again divided by t, are shown in Fig. 21 This time the 
corrections to scaling are much bigger. This is not un- 
expected, since there arc also large corrections to the 
asymptotic law {St) ^ t for ordinary 3-d random walks. 
These corrections make an independent estimate of Uc 
impossible, whence we shall use the estimate obtained 
from i.e. Uc = 1.831 ± 0.002. The corrections 
to scaling also make the estimation of the exponent 
kc very uncertain, in spite of the extremely small sta- 
tistical errors (much smaller than the thickness of the 
curves). Our best estimate is fc^ = 0.977 ± 0.010. This 
is again incompatible with the estimate 0.91 ± 0.01 of 
[ll|. The leading correction to scaling exponent, de- 
fined as {St) = t^-^ia + b/t^ + o{t-^)], is found to be 
A = 0.22 ± 0.03. This is to be compared to A = 1/2 for 
ordinary 3-d walks [2^. 

For the supercritical case, u > Uc, the following argu- 
ment was given in Q : Let us assume that the visited sites 
form, for large t, a compact c?— dimensional domain Vt 
whose volume increases as St = \Vt \ on Rf with Rt ^ f^. 
Its surface is fuzzy but not fractal, i.e. it increases as 
\dVt\ oc If the walker is uniformly distributed in- 

side Vt, then the chance for it to be at the boundary is 
\dVt\/St oc 1/Rt- This is then also proportional to the 
chance that the walker will make the next step outside Vt, 



i.e. d{S)/dt ~ t^'^ . Integrating this gives = l/{d+ 1) 
0. The main assurnption here is not that dVt is non- 
fractal (as stated in [a, El), but that the walker is uni- 
formly distributed inside Vj. This would be exact if the 
boundary would not grow at all (i.e. in the limit u — > oo), 
but for finite u it corresponds to a quasistatic approxima- 
tion in the sense of [l^l . 

In order to test this quasistatic approximation of the 
supercritical behavior, we plot in Fig. [5] the ratio Rt/Vt 
for several values of u > Uc. We see very large corrections 
to scaling (the corrections to {St) would be even larger), 
but the curves do seem to become horizontal for t oo. 
For u > 2.5, our best estimate is Rt ~ with i' = 
0.25 ± 0.01, in perfect agreement with the prediction of 
the quasistatic approximation. 



D. Site TSATWs in d = 2 

In two dimensions the situation seems at first glance 
similar, except for the fact that corrections to scaling are 
even larger. The latter is not surprising: random walks 
are recurrent in d = 2, while they arc not in any d > 2. 
The number of visited sites increases not as t in d = 2, but 
as St = 7rt/ln(8t)[l 4- (9(1/ In t)] \2^. Related to this is 
the fact that true self avoiding walks have upper critical 
dimension c? = 2, leading to logarithmic corrections in 
most observables for d = 2. As a consequence, one should 
also expect logarithmic corrections for TSATWs. 

Results for the end-to-end distance are shown in Fig.[Bl 
Again we show a log-log plot of Rt / 1, for easy comparison 
with Fig.[2l The main difference between these two plots 
is that the curves fan out in Fig. [S] already for very small 
t, while they fan out only at much later times in Fig. [21 
While the curves for u < in Fig. [51 first seem to follow 
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FIG. 5: (color online) R^t/sft for site TSATWs in 3 dimen- 
sions, for five values of u which are all larger than Uc. The 
curves seem to become horizontal ast^oo, with the asymp- 
totic behavior appearing for smaller values of t as ?i increases. 
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FIG. 6: (color online) Same as Fig. (2] but for d = 2. 



the scaling Rt ^ f^" and cross over to Rt ^ t only at large 
t, no such cross-over is seen in Fig. [S) Careful inspection 
shows that all curves for u > 0.58 (i.e. e" > 1.79) bend 
down at large t, indicating that < 0.58 and that the 
estimate Uc = 0.88 ± 0.05 of [l^l is untenable. If we want 
to see a critical point with an associated non-trivial power 
law in these data, then a possible candidate is Uc ~ 0.54 
and i/c ~ 0.47. 

An attempted data collapse for the data of Fig. [HI again 
using Eq.([8]) and optimized values Uc = 0.548, i^c — 0.475, 
and (f) = 0.085, is shown in Fi g. W\ We might mention 
that the exponents proposed in [11|, Vc = 0.40 ±0.01 and 
(j> « 0.2, seem to be ruled out. A data collapse using these 
exponents is shown in panel (b) of Fig. [T] Although it has 
an acceptable overall dispersion, this is achieved mainly 
by fitting well the small-i data, and grossly misrepresent- 
ing data for large t. 
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FIG. 7: (color online) Attempted data collapse analogous to 
Fig. |3l but for the data of Fig. [6] Parameters in panel (a) 
are ty^ = 0.475, = 0.548 (wc = 1.730), and cj> = 0.085, 
while panel (b) uses the values Vc = 0.40, Uc = 0.88 {wc = 
2.411), = 0.2 proposed in [ll|- 
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FIG. 8: (color online) The function *t(u) [see Eq. ^] plotted 
against t for both positive and negative values of u, including 
u = (w = 1). The fact that this figure resembles a typical 
cross-over plot as in Fig. [2] suggests that u = is a critical 
point. 
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FIG. 9: (color online) Part of the data shown in Fig. [S] but 
plotted against l/t"'"^"^ . This exponent gave the straightest 
curves for w — exp{u) < 0.5. No exponent would give straight 
curves for w > 0.8. 
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FIG. 10: (color online) Same as Fig. H but for d = 2. The 
curves correspond to ui = 0.2, 0.4, 0.6, . . . 2.6 (from top to bot- 
tom). 



Although the collapse seen in Fig. [Tl^a) is satisfactory, 
the smallness of 4> and the closeness of i^c to the random 
walk value v = 1/2 suggest a very different interpreta- 
tion. We propose that there is in fact no phase transition 
at any Uc > 0. Instead, the TSATW is collapsed for any 
M > 0, i.e. Uc = 0. This is also consistent vi^itli the 
fact that 2-d random walks are recurrent, i.e. the inter- 
action should be a relevant perturbation for any m > 0. 
It is difficult to obtain direct numerical evidence for this 
scenario, due to the very slow cross-over from the ran- 
dom walk behavior to the collapsed behavior, and due to 
the presence of strong corrections. In order to make any 
progress, we have to understand better these corrections. 

In order to analyze the behaviour for very small u more 
closely, let us define the quantity 

^tiu) = -^HR^/t] (9) 

It is obviously well defined for u ^ 0, but it can be defined 
also for u = using THopital's rule, 

1 1 r)R^ 

MO) = hm -ln[i??A] = (10) 

u-tO u t ou 

We used here the fact that Kl =t exactly for m = 0. Nu- 
merically, \E't(0) can be estimated by a slight generaliza- 
tion of the reduced variance method discussed in subsec- 
tion [UbJ We simulate just ordinary random walks, but 
keeping track of the visited sites and calculating dR^ jdu 
using Eqs. (2), (5), and (6). 

It is easily seen that ^'t(u) is positive for all u. Plots 
of 4't(u) versus both for positive and for negative val- 
ues of u, are shown in Fig. [51 Assume there is a collapse 
transition at u = Uf.. We then expect that ^t{u) di- 
verges as \nt for u > and i — > oo, while it should 
stay bounded for u < Uc- More precisely, we expect that 



'^tiu) ~ const — a/t^ for u < Uc, where S is another cor- 
rection to scaling exponent. Plotting ^'((u) versus t~^ 
should thus give straight lines converging to finite values 
for t^^ ^ if It < Uc, but upward bent curves diverging 
for t^^ if u > Uc- One such plot, showing (w) ver- 
sus is given in Fig. [HI From this and similar plots 
with different exponents, we conclude that (i) the data 
are consistent with this scenario; (ii) the critical point is 
at Uc ~ 0, most likely at Mc = exactly; (iii) the cor- 
rection to scaling exponent is (5 = 0.22 ± 0.05; and (iv) 
at the critical point, ^'t scales either as ^'t(O) ~ Inlni 
or ^'((0) ~ [Int]" with < a < 1. The former 
(^'4(0) ~ Inlni) seems preferred, but a clear distinction 
between these alternatives is not possible. 

Studying St, the number of visited sites, is not very 
revealing. As seen from Fig.fTUl there is no value of m for 
which the curve is straight, u 0.7, w = e"«2 yields 
the straightest curve in the large t range 10^ < t < 10*, 
but this is clearly not asymptotic, as the curves for larger 
u indicate (they have not crossed over to asymptotic be- 
havior and hence curve up for large t, although they fi- 
nally curve down, for very large t) . 

For coupling constants w 3> 1 one finds again that the 
prediction of the quasistatic approximation, Rt ^ t^^^, 
is in excellent agreement with the data (see Fig. [TT|) . As 
in the 3-d case, corrections to this prediction are very 
large for small values of u, but they decrease quickly for 
M — > 00. One would like of course to verify the quasistatic 
approximation for smaller u, but this seems at present 
impossible without going to lattice sizes beyond the reach 
of our normal computational resources. 



E. Bond TSATWs in d = 3 

We now turn to the bond reinforced random walk in 
d = 3. Results for the end-to-end distance R^ arc shown 
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FIG. 11: (color online) R^t^^'-^ for site TSATWs in 2 dimen- 
sions, for seven values of u which are all larger than Uc- The 
curves seem to become horizontal as t oo, with the asymp- 
totic behavior appearing for smaller values of t as u increases. 



in Fig. [121 The plot is superficially quite similar to Fig. [21 
with substantial corrections to scaling and the critical re- 
inforcement Uc occurring at much higher u. This latter 
fact is unsurprising as bond-reinforcement is much more 
"dilute" than its site-reinforced cousin; consider that the 
equivalent of a visited site in the bond-reinforced model 
must have all six bonds visited in d = 3. Analysis sug- 
gests Uc = 2.475 ± 0.003 and = 0.380 ± 0.004. Note 
that Vc is within error of the estimate Vc = 0.378 ± 0.004 
for the Site TSATW in d = 3. This is the first piece of 
evidence that the bond and site models are in the same 
universality class. 

In Fig. [T31 we show a data collapse with the same 
scaling ansatz Eq.®; Uc = 2.475, t^c = 0.380 and (j) = 
0.185 ±0.020. The critical exponents Vc and are within 
error and identical, resp., to those for the Site TSATW 
in d = 3, see Fig. [31 and the data collapse is if anything 
even better than that of Fig. [31 We see similar correc- 
tions to scaling at small t and excellent collapse at large 
t, facilitated by our ability to simulate long walks ( 10^) 
due to the high value of Uc- 

Results for the average number of visited sites, {St) 
are not shown, but are similar to Fig. [4l with substan- 
tial corrections to scaling. We thus use the estimate of 
Uc obtained from Rt, i.e. Uc = 2.475 ± 0.002. The best 
estimate of the exponent kc (again made difficult by cor- 
rections to scaling) is kc = 0.970 ± 0.010. This is incom- 
patible with the estimate 0.91 ± 0.01 of [ll| but within 
the error of our estimate for the site-reinforced model, 
kc = 0.977 ± 0.010. The leading correction to scaling 
exponent, defined as {St) = t^^la + b/t^ + o(i~^)], is 
found to be A = 0.25 ± 0.05. This is within error of the 
estimate A = 0.22 ± 0.03 in the site case. 

Hence in all cases the estimates of the critical ex- 
ponents for the site-reinforced model given by (ll| are 
excluded by our results as candidate exponents for the 



FIG. 12: (color online) Average squared end-to-end distances 
for bond TSAWs in 3 dimensions, divided by the number t of 
steps. Each curve corresponds to a fixed value of it, with u 
increasing from top to bottom. The critical value u = Uc = 
2.475 (wc ~ 11.88) corresponds to a straight curve in the limit 
t ^ oo whose slope is 2vc — 1. Here and in all subsequent 
figures, the curves are labelled hy w = exp(it). Statistical 
errors are comparable to the thickness of the curves. 
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FIG. 13: (color online) The same data as in Fig. 1121 plotted 
now as Rt/t^"" versus {u-Uc)t^, with Uc = 0.380, itc = 2.475, 
and (j) = 0.185. Note that the critical exponents are within 
error bars of those used in the data collapse of Fig. |3] 



bond-reinforced model. The estimates of all expo- 
nents for the bond-reinforced model agree (within er- 
ror) with those we obtained for the site-rcinforced case, 
see Sec. Ill CI It is thus unsurprising that Fig. [TH simi- 
larly verifies the quasistatic approximation Rt ^ with 
V = 0.25 for u > Uc in the large t limit. Crossover to the 
asymptotic behavior occurs at smaller < as u — > oo. It 
is harder to verify the large t limit for small u > Uc', we 
cannot run sufficiently long walks while limiting spurious 
self- intersection and hence reliably estimating Rt- 
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FIG. 14: (color online) Rl/Vt for bond TSAWs in 3 dimen- 
sions, for five values of u which are all larger than Uc. The 
curves seem to become horizontal ast^ oo, with the asymp- 
totic behavior appearing for smaller values of t as u increases. 
This verifies that the quasistatic holds for bond as well as site 
reinforcement. 
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FIG. 16: (color online) Attempted data collapse analogous to 
Fig. 1131 but for the data of Fig. 1151 Parameters in panel (a) 
are Va = 0.481, Uc = 0.730 {w^ = 2.075), and <t> = 0.058, while 
panel (b) uses the values Vc = 0.40, Mc = 0.88 {wc — 2.411), 
cj) = 0.2 proposed in [ll[. The latter collapse is completely 
unsatisfactory. 
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FIG. 15: (color online) Same as Fig. [121 but for d — 2. 



F. Bond TSATWs in d = 2 

Our results for d = 3 indicate that the bond- and site- 
reinforced models are in the same universality class. This 
implies that Wc = for bond TSATW in d = 2. To test 
this, we study small u walks, which will cross over to the 
collapsed behavior only for large values of t. This regime 
is even more difficult to study in the bond reinforced case, 
due to the dilute nature of the bond reinforcement. The 
simulations used are as large as possible {2^^ sites and 
walks of w 10^ steps) but in many cases these walks just 
reach the beginning of what may be the scaling regime. 

In Fig [15] we show results for the end-to-end distance 
Rt/t. As in the site-reinforced case for d = 2, the curves 
fan out in Fig. [TS] for small t, with no apparent cross- 



over of the sort seen in Fig. [6] or Fig. [121 An estimate 
of the critical reinforcement Uc from these data is ex- 
tremely difficult. An attempted data collapse for the 
data of Fig. [THl using the scaling ansatz Eq.([Sl) and op- 
timized values Uc — 0.73, 1'c = 0.481, and 4> — 0.058, is 
shown in Fig. [TCT a). While the data collapse accept- 
ably for these values, the exponents proposed in [ll| . 
Uc = 0.88, i^c = 0.40 ±0.01 and cj) « 0.2, can be ruled out, 
as a data collapse using these exponents is completely 
unsatisfactory. Fig. rTBT b) . 

As the estimated <j> = 0.058 is even smaller than that 
obtained for the site-reinforced model in d = 2 (where 
(j> = 0.085) and as i^c = 0.481 is very close to the ran- 
dom walk value ly = 1/2, we argue that there is no phase 
transition for Uc > in the bond-reinforced model, ei- 
ther. In particular, similar heuristic arguments about 
the recurrence of 2-d random walks suggest again that 
any non-zero reinforcement is a relevant perturbation. 
Hence we study again the function ^t{u) [Eq. ^] for 
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FIG. 17: (color online) The function "^itiu) [see Eq. ©] plot- 
ted against t for various positive and negative values of u, 
including u = 0. This figure again resembles a cross-over plot 
as in Fig. [S] suggesting that it = (u; = 1) is a critical point 
for bond-reinforcement as well. 
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FIG. 18: (color online) for bond TSAWs in 2 dimen- 

sions, for seven values of u which are all larger than Uc- The 
curves seem to become horizontal for t oo and u oo. 



M > 0, M = 0, and u < (see Fig. [T7]). As for the site 
case (subsection IIID|) . plotting ^((m) against with 
different exponents 6 reveals the detailed asymptotic be- 
havior. Such plots (not shown here) indicate that Uc = 0, 
and that the correction to scaling exponent in the uncol- 
lapsed phase u < Uc is S — 0.20 ± 0.05, well within error 
of the estimated 6 — 0.22 ± 0.05 of the site case. 

As was the case for the site-reinforced model in d = 2, 
it is not particularly illuminating to study the number 
of visited sites St- The corrections to scaling are even 
larger, and hence it is impossible to estimate the correct 
scaling exponent from these data. 

For coupling constants m 3> one finds that the pre- 
diction of the quasistatic approximation, Rt ^ is 
in excellent agreement with the data (see Fig. [T5)) . Cor- 
rections to this prediction are very large for small values 
of u, but become irrelevant as u —f oo, as is already 
apparent at w = e" = 16. Familiar limitations to ac- 
cessible lattice size make the quasistatic approximation 
impossible to verify for smaller u, where crossover to the 
asymptotic behavior takes place at very large t. Settling 
the validity of the quasistatic approximation in the small 
u regime will in all likelihood require the development 
and application of appropriate analytical methods. 



III. DISCUSSION AND OUTLOOK 

Despite its simplicity, the once-reinforced site variant 
of the True Self Attracting Walk (TSATW) has generated 
considerable controversy since its original statement by 
Sapozhnikov In this paper we have used a combi- 
nation of careful high-statistics simulations and heuristic 
arguments to attempt a resolution of many of these dis- 
putes. 



In = 3 we confirm the existence of a phase tran- 
sition from random walk-like to collapsed behavior for 
finite reinforcement u^,. Our simulations provide over- 
whelming evidence for rejecting the proposed Uc and scal- 
ing exponents of [13, El. We find Uc 1.831 ± 0.002 
and i^c = 0.378 ± 0.004, with the crossover exponent 
4> = 0.185 ± 0.020. In addition we verify the quasistatic 
approximation ^ ^/i for large t and u. 

In d = 2 we argue that there is no phase transition 
at any finite reinforcement Uc- For any m > the walks 
go to a collapsed phase, and the "critical behavior" at 
Mc = is simply that of a random walk. The quasistatic 
approximation i?f ~ t^/'^ is also verified for large t and 
u. 

In addition to the site-reinforced TSATW, we stud- 
ied the bond-reinforced variant and found evidence that 
despite the underlying mathematical differences (and re- 
lated difficulties) bond-reinforced TSATW is in the same 
universality class as site-reinforced TSATW. In d = 3 
we found a phase transition at finite reinforcement ~ 
2.475 ± .003 and scaling behavior extremely similar to 
the site-reinforced model, with similar success of the qua- 
sistatic approximation. In d = 2 we found evidence of a 
phase transition at Uc = although the evidence here is 
somewhat weaker due to the long time needed to cross 
over to the collapsed behavior and the memory limita- 
tions imposed by the extremely large lattices needed to 
minimize spurious self-intersection. 

An obvious limitation of our work is the lack of an an- 
alytical understanding as to why the transition to a col- 
lapsed phase should occur at any finite reinforcement in 
the site- and bond- models in d = 2. In the mathematics 
literature, the once-reinforced ERRW has been studied 
by mapping it to a diffusion with a drift term (directed 
inward) at the boundary [3, [23] . As far as we know this 
technique has only been applied in d = 1 and is even in 
this case of considerable technical difficulty. There are 



10 



also techniques mapping the stochastic process to a (de- 
terministic) dynamical system, the so-called "stochastic 
approximation" .which is largely unknown to the physics 
literature [l^, [lj| . This suggests that some sensible map 
to a continuous process or a dynamical system might en- 
able an analytic proof of Uc = in one or both of the 
variants of once- reinforced TSATW in d = 2. 

More generally, the universality result we propose for 
site and bond TSATW in d = 3 and d ~ 2 suggests 
the possibility of a deep dialogue between the statisti- 
cal physics and probability literatures. The perspective 
of statistical physics generates different questions (with 
respect to phase transitions, critical behavior, and uni- 
versality) that complement the rigorous results derived 
within the probability community. Furthermore, the 
probability literature as reviewed in contains an enor- 
mous number of unexplored models for random walks 
with reinforcement. It is also clear that these walk pro- 
cesses are specific instances of a general study of random 



processes with reinforcement, with many applications in 
the biological and social as well as physical sciences [l^ . 
The statistical physics of such models remains an almost 
entirely open question. 

We end on a cautionary note about the use of simu- 
lation in these problems. As pointed out by Pcmantle 
[l^ . the convergence times for some random processes 
with reinforcement can be astronomical; the Friedman 
urn, for example, does not reach its asymptotic behavior 
until a googol updates or more. This suggests that in 
some cases the high statistics simulations that would be 
applied by statistical physicists may only be probing the 
transient behavior of such models. While the transient 
behavior has its own intrinsic interest, we suggest that a 
dialogue between the two fields would do much to drive 
research in mutually beneficial directions-while avoiding 
pitfalls along the way. 
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